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Regulatory sequences of the mouse villin gene - Use in transgenesis. 

The invention relates to regulatory sequences of the mouse villin gene that efficiently 
drive transgenic expression in immature and differentiated epithelial cells of the 
intestine and uro-genital tracts. 

Villin is a cytoskeletal protein which is mainly produced in epithelial cells that develop 
a brush-border responsible for absorption as in the digestive apparatus (epithelial cells 
of the large and small iniestine) and in the urogenital tract (epithelial cells of the kidney 
proximal tubules). Because it is expressed in the proliferative stem cells of the intestinal 
crypts (16,17), it is believed to be an early marker for commited intestinal cells. The 
multiple levels of regulation control villin gene activity during mouse embryogenesis 
(18-20) and account for the strict pattern of tissue-specific expression observed in 
adults. Moreover, the expression of the villin gene in intestinal epithelial cells is 
conspicuously maintained in their correspondent carcinomas (21-24). 
The specific expression pattern of villin suggests that it is an appropriate candidate for 
the characterization of regulatory sequences that could allow targeting of heterologous 
genes into a selected population of cells in the mouse digestive tract. 
In order to design new constructs and systems enabling the targeted expression of genes 
in epithelial cells of intestinal or urogenital tracts, the inventors have investigated the 
underlying molecular mechanisms and particularly those responsible for the restricted 
tissue specificity of the expression of villm. 

The invention therefore provides new regulatory sequences encompassing cis-acting 
elements involved in the regulation of the transcription and of the expression of the 
murine villin gene. 

The invention also relates to recombinant constructs comprising said regulatory 
sequences, for the control of the targeted expression of determined nucleic acid 
sequences so-called (heterologous sequences or also transgenes), in cells or tissues 
originating from the intestinal mucosa. 
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A further object of the invention is to provide cells, tissues or organisms including 
animals, expressing said determined nucleic acid sequences in a targeted manner 
Transgenic mice are routinely used to study the molecular and cellular basis of normal 
and pathological states in intestinal mucosa (1-5). The major limitation regarding the 
targeting of exogenous transgenes in this tissue, is that the epithelium of the mouse 
intestinal mucosa is renewed every 2-5 days (6-8). The epithelial cells arise from 
multipotent stem cells functionally anchored at the base (more precisely in the lower 
third) of the epihelium's proliferative compartment, the crypts of Lieberkiihn. These 
crypts display a monoclonal organization since they are each derived from a single 
progenitor cell (9). Descendants of stem cells multiply in the middle portion of each 
crypt (10), and gradually differentiate into four principal cell types. In the small 
intestine, absorptive enterocytes (constituting >80% of the epithelial cells), mucus- 
producing globlet cells and enteroendocrine cells migrate upward from the crypts to the 
apex of surrounding villi (whose colonic counterparts are hexagonal shaped cuffs) (II), 
where they become apoptotic and are exfoliated into the gut lumen (12). In contrast, 
antimicrobial peptides-secreting Paneth cells migrate to the bottom of the crypts, where 
they reside for about 20 days (13), 

Given the remarkable protective effect of this epithelium, it is not unexpected that most 
previous studies aiming to induce neoplastic transformation in intestinal mucosa of 
transgenic mice have failed (14, 15). In these prior reports* the use of promoter 
sequences which direct oncogenes in non proliferating enterocytes located in the upper 
third of crypts produce only minor phenotypie abnormalities without tumorigenic 
consequences in the gut epithelium, suggesting that the residence time of these villus- 
associated cells may not be sufficient for the oncogenes to exert their effects. 
Purthemore this suggests that transgenic mouse models of neoplasia may require an 
efficient targeting of oncogenes in crypts stem cells or their immediate descendants. 
With this goal in mind, the human villin gene has been isolated and characterized (25). 
A 2 kb 5 -flanking region has been found to contain sufficient regulatory elements to 
promote tissue-specific expression of a reporter gene in intestinal and renal cell lines 
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(26). In transgenic mice, this regulatory region is able to drive the expression of the 
human Ha-ras oncogene in the tissues in which the endogenous gene is actively 
transcribed. However low levels of expression were observed that did not trigger 
malignant tissue appearance into the gut of these animals. 

The invention provides new means for the targeted expression of heterologous 
sequences in cells originating from intestinal and/or urogenital tracts. Advantageously, 
the invention encompasses but is nor limited to nucleotide sequences that should 
overcome at least in part, some of the deficiencies of the previously described gene 
constructs prepared with regulatory sequences of the human villin gene ; especially 
having recourse to regulatory elements promoting the expression of the murine villin 
gene, the inventors have designed new DNA constructs that may improve the efficiency 
of the targeted expression of heterologous genes in transgenic animals, with respect to 
the results obtained when said heterologous genes are placed under the control of the 
human villin regulatory elements contained in the above disclosed 2 kb sequence. 
The inventors have analysed an extended genomic region of the mouse villin gene with 
the goal of mapping elements localized at the 5 T and/or 3' ends and possibly involved in 
promoting high levels of targeted expression of heterologous sequences in epithelial 
stem or differentiated cells, specifically originating from the intestinal mucosa. 
As a result, the inventors have identified a transcriptional regulatory region that enables 
efficient targeted tissue-specific or cell-specific expression, reproducing the expression 
pattern of the endogenous villin gene in mice, including expression in the crypt stem 
cells of the colon and in differencitated cells. 

In accordance with the invention the identified villin regulatory region provides a 
molecular tool for the establishment of new cell lines, including new immortalized cell 
lines, particularly epithelial cell lines originating from intestinal tissue. It also provides 
means appropriate for the preparation of transgenic organisms, especially transgenic 
animals, including transgenic mice, enabling the targeted expression of determined 
nucleotide sequences. 
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The invention therefore discloses appropriate tools for the study of pathological states 
related to a dysfunction of gene expression pattern or for The treatment or prevention of 
said pathological states, for example to enable targeted expression of a gene acting as 
repairer gene in order to compensate for the dysfunction of an endogenous gene, or to 
add a novel function in cells or to suppress a determined function or state, or its 
consequences. 

The study of pathological states can benefit from the establishment of transgenic model 
animals wherein induction of a pathological state is obtained in relation with the 
targeted expression of genes. It also relates to the study of rescue of pathological states. 
The invention thus relates to a nucleotide sequence derived from the 5 ? sequence of the 
murine villin gene, having a size of 9 kb on an agarose gel, or a fragment thereof, 
comprising nucleotide elements having a cis-regulatory activity that promotes the 
transcription of the murine villin gene. 

In accordance with the invention, the expression "nucleotide sequence" designates any 
type of nucleotide sequence, especially DNA> whatever its origin, including genomic, 
cloned, amplified, recombinant or synthetic sequences. 

The term "derived from" characterizes the fact that the sequence is defined with 
reference to the isolated endogenous sequence of the murine villin gene. Said nucleotide 
sequence derived from the murine villin gene nevertheless encompasses sequences 
obtained without having direct recourse to the native isolated gene. Furthermore, the 
structural and/or functional characterizing features of the nucleotide sequence may 
correspond to their native counterpart in the murine villin gene or may be modified as a 
result of mutation, deletion, truncation, or addition of nucleotides or nucleotide 
fragments provided its function of promoting the transcription and/or expression of the 
villin gene is not substantially affected 

The sizes of the nucleotide sequences which are indicated in the present patent 
application may vary: indeed, the indicated sizes correspond to the size deduced from 
the hand visualized on an agarose gel 7 in conditions corresponding to those given in the 
examples. 



WO 00/34492 PCT/EP98/08OO9 



Especially, a variation of the actual size of within the range of 20% to 5% especially 
around 15 or 10 % of the sequence, is encompassed within the given size indication. 
A nucleotide sequence of the invention is a sequence having a cis-regulatory activity 
that promotes the transcription of the murine villin gene thus enabling the transcription 
5 ending in the production of mRNA and subsequent expression of villin protein, in 
intestine epithelial cells, Conditions appropriate to test said regulatory activity are 
described in the examples. 

Where necessary, access to the genomic DNA of the murine villin gene is enabled 
through various available libraries. Clones obtained from a genomic library can indeed 
10 be used to isolate the nucleotide sequences of the invention, provided said clones 
■J contain sequences comprising a fragment having a si2e of at least 9 kb upstream from 
in the translation initiation codon. Said fragment can be obtained from one clone or from 
: j overlapping genomic sequences contained in several clones. Probes derived from the 
HJ cDNA of the villin gene can be used to identify appropriate clones in a genomic library 
^15 of murine genes. Especially, such probes can be designed around the translation 
intiation codon and have advantageously at least 100 nucleotides, including sequences 
Ifl having 200 to 500 nucleotides, or more. 

;S Such probes can also be designed starting from the sequence SEQ ID NO: ] and can be 
p synthesized or obtained by amplification. They can be derived from the 5' sequence 
20 upstream of the translation initiation codon. 

According to specific embodiments, the invention relates to various nucleotide 
sequences derived from said 9 kb sequence. Especially, the invention relates to: 

- a nucleotide sequence which is the sequence extending 5,5 kb upstream and 3.5 kb 
downstream from the transcription initiation site of the murine villin gene ; 

25 - a nucleotide sequence which is identified as SEQ ID NO: 1 and which is represented 
on Figure 6; 

- a nucleotide sequence which comprises or which is the nucleotide fragment extending 
from the HS I to the HS IV Dnase I-hypersensitive sites, 
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The DNAsel hypersensitive sites can be mapped on a genomic DNA corresponding to 
the mouse villixi gene with the probes described on figure 2. 

- a sequence which comprises or which is the nucleotide fragment extending from the 
HS IV Dnasel-hypersensitive site to the translation initiation codon of the murine villin 
gene; 

- a sequence which comprises or which is the nucleotide fragment extending from the 
nucleotide at around position -100 upstream from the transcription initiation site, to the 
translation initiation codon; 

- a sequence which comprises or which is the nucleotide fragment extending 3.5 kb 
upstream from the transcription initiation site to the transcription initiation site and 
further comprises the translation initiation codon, and possibly the sequence of exon 1 
and the sequence of exon 2 starting 5 1 from the ATG codon; 

- a sequence which comprises or which is the nucleotide fragment extending from 
around the nucleotide at position -480 from the transcription initiation sequence, to the 
translation initiation codon; 

- a sequence which comprises or which is the sequence extending 3.5 kb upstream from 
the transcription initiation site to the translation initiation codon, provided the region 
corresponding to intron 1 5 located between said sites is deleted, or deleted in part; 

- a sequence which comprises or which is derived from the nucleotide sequence of the 
murine villin gene having a size of 9 kb on an agarose gel and extending 3.5 kb 
upstream from the transcription initiation site and 5 J kb downstream from said site, or a 
fragment thereof, said nucleotide sequence or fragment thereof having a regulatory 
activity on the transcription of the murine villin gene and/or on the level of expression 
of the murine villin gene in intestine cells and/or in transgenic mice. 

The transcription regulatory activity of the murine villin gene is cited as a reference for 
the identification of nucleotide sequences encompassed within the scope of the 
invention. The invention furthermore relates to a nucleotide sequence which is derived 
from said 9 kb sequence of the murine villin gene and which enables the transcription of 
a heterologous sequence, with respect to this gene. 
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According to another embodiment, the invention relates to a sequence having a 
regulatory activity on the transcription of a villin gene, which sequence is a non-human 
sequence and which hybridizes in stringent conditions with at least one of the above 
nucleotide sequences. 

Especially variant nucleotide sequences can be obtained from other animals of the 
Vertebrates or Invertebrates groups, especially from mammals, or from birds especially 
chicken / or from fishes* Referring to Invertebrates nucleotide sequences can be otained 
from Drosophila or from C elgg ans. 

Although the above defined sequences have been shown as being able to promote 
efficient transcription and expression of heterologous sequences in epithelial cells of 
intestinal origin or in transgenic mice, their structural variations may affect the 
efficiency of their regulatory activity, either with respect to the tissue - or cell- 
specificity of this activity, or with respect to the expression level observed. 
They can therefore be used to promote the targeted transcription and expression of 
genes (or more generally any nucleotide sequence of interest) in epithelial cells of the 
intestinal or uro-genital tracts, either in stem cells or in differentiated cells. 
The invention thus relates to a recombinant nucleotide sequence which comprises a first 
nucleotide sequence and a second nucleotide sequence for which a tissue specific 
targeted expression in epithelial intestine cells is sought. 

In a particular embodiment of the invention, the second nucleotide sequence is a 
sequence encoding a determined polypeptide, protein or peptide, all designated hereafter 
by the term "polypeptide". 

The second nucleotide sequence may also be a sequence of therapeutic interest such as 
tumor suppressor gene, a functional inhibitor of a gene, an antisense sequence, an 
oncogene, an immobilizing gene, a normal gene for the restauration of a function, or 
more generally any sequence the targeted expression of which in epithelial cells of 
intestinal origin may present an interest, for instance in processes for preventing, 
controlling or curing pathological states including those states related to the 
development of tumors. 
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In a particular embodiment of the invention, the second nucleotide sequence codes for 
an antigen or an immunoglobulin or for fragments thereof, including variable chains or 
immunoglobulins. 

In another embodiment of the invention, the second nucleotide sequence is an oncogene, 
A recombinant sequence of the invention comprising an oncogene may be used for 
studies relating to carcinogenesis especially in animal models expressing said 
recombinant sequence. 

According to a specific embodiment of the invention the second nucleotide sequence 
which is placed under ihe control of the regulatoiy sequences of murine villin gene, is 
further placed under the control of an inducible system, for example the 
Tetracycline/Doxycycline mediated temporal control of gene expression in transgenic 
mice (Kistner A- et al 7 1996, PNAS, 22, 10933-10938). 

The invention also concerns vectors containing the nucleotide sequences of the 
invention especially plasmids or cosmids, appropriate for the transfer and/or expression 
of these sequences in ceils or transgenic animals. 

The invention also concerns recombinant epithelial cells comprising a recombinant 
sequence as described above. 

These epithelial cells encompass stem cells, especially crypt stem cells of the intestinal 

mucosa, or differenciated cells, especially commited intestinal cells. 

The invention further relates to an epithelial cell originating from the kidney proximal 

tubules recombined with the nucleotide sequences of the invention. 

The epithelial cells of the invention can be immortalized cells, especially as a result of 

the expression of an immortalizing gene (e.g., AgT (tsA58)) the expression of which is 

drived and targeted by the regulatory nucleotide sequence of the invention. 

The administration of the recombinant nucleotide sequence of the invention may be 

made by the available techniques including ex vivo or in vivo administration processes, 

especially by electroporation, calcium phosphate precipitation, liposomes,.. 

The invention also concerns transgenic animals obtained by transgenesis of recombinant 

nucleotide sequences of the invention. 
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These transgenic animals encompass animals from the Vertebrates or Invertebrates 
groups and are especially birds for instance chicken or fishes or non-human mammals, 
more particularly mice, invertebrates like Drosophila or Nematodes* like C. elegans can 
also be used for the preparation of transgenic animals expressing recombinant sequences 
5 of the invention under the control of regulatory sequences derived from their 
endogenous viliin genes. Appropriate processes for the preparation of the transgenic 
animals expressing in a targeted tissue specific way, recombinant nucleotide sequences, 
are disclosed in the following examples. 

Such a process for the preparation of transgenic mice, advantageously comprises the 
10 steps of: 

□ - administration of a transgene into the pronuclei of fertilized eggs of mice, 

^ - enabling the development of the recombined eggs to recover transgenic mice 

(founders) and verifying the presence of the transgene, 
ii] - if appropriate crossing the founders with non transgenic mice. 

" | 5 Further steps, including crossing between mice capable of expressing the sequences of 
the invention with mice expressing different sequences (e.g. sequences encoding 
;p inducible systems) in order to obtain double recombinant mice. 

f-*20 LEGENDS OF THE FIGURES 

Figure 1 : 

Determination of the transcription start site of the mouse viliin gene by primer 
extension. A, primer extension analysis was performed with mouse intestinal total KNA 
(30 >ig) and with either the end-labeled villin oligonucleotide (generating a 105- 
25 nuleotides extension product) or the end-labeled mouse intestinal fatty acid binding 
protein gene (Fabpi) oligonucleotide used as a positive control (generating a 81- 
nucleotides extension product). The size of the fragments obtained by primer extension 
is shown at the left. The unrelated sequence ladder that was run in the same gel is used 
as a size marker, B> nucleotide sequence between the transcription start site (the bold 
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adenosine designated as +1) and the initiation codon (the bold underlined ATG codon) 
of the mouse villin cDNA. Each of the splice junctions present in the intron 1 (indicated 
below) conforms to the consensus splice donor (the italique GT nucleotides) and 
acceptor (the italique AG nucleotides) patterns, described by Breathnach and Chambon 
5 (30). C ? schematic representation of the organization of the 5" -flanking region of the 
murine villin gene. The open box represents the untranslated exon and the shadowed 
box represents the first coding exon. The size of the exon and the intron is indicated. 

Figure 2 : 

DNase I-hypersensitivity in the mouse villin gene. A, a partial restriction map 
diagram of the mouse villin gene regions subcloned (-3.5 to +9S kb in respect to the 
transcription start site, indicated by an arrowhead). BamHl (B), BgUl (Bg), Eco RI (E), 
Hind IE (H) restriction sites, ATG initiation codon and the probes used to map the 
hypersensilives sites (0.5, 0,8 and L25 kb) are shown, B and C, intestine, kidney, liver 
and spleen nuclei were digested with increasing amounts of DNase I at 0 °C for 10 min 
(0, 20, 40, 80, 160 units). 10 pig of purified genomic DNA was digested with BglU. 
(panel B), and BamJR (panel C), electrophoresed and transferred to a nyion membrane. 
Hypersensitives sites were revealed by probing with a 32 P~IabeIled fragment of 0.5 kb. 
Positions of coelectrophoresed molecular weight markers are indicated at the left, and 
the hypersensitive bands are marked by arrows at the right The maps represented below 
show the position of restriction sites, the deduced DNase I-hypersensitive sites 
(indicated by arrows) and the 0.5 kb probe used. 

Figure 3 ; 

25 Transient transfection analysis of the mouse villin promoter. A, above a partial 
restriction map diagram of the mouse villin gene from 9 kb with respect io the 
translation initiation codon. Apal (A), BamWl (B), B g m (Bg), BstBll (Bs), Drdl (D), 
Ncol (N) and Xbal (X) restriction sites are shown. The schematic representation below 
shows the location of the four hypersensitive sites (MV) as well as the 5.5 kb intron 
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(represented by a cross-hatched rectangle) separating the transcription stare site 
(indicated by an arrowhead) and the translation initiation codon. B 7 diagrams of the 
various constructs generated by deletion. Different portions of the 5 '-flanking region of 
the mouse villin gene were fused with the E. coli p-galactosidase gene containing the 
nuclear localization signal (nls). C, {J-galactosidase activities resulting from transient 
transfections into CaCo2 colon cells (shadowed bars) or LLCPK1 kidney cells (open 
bars) with the reporter constructs generated (represented in panel B). Basal activity 
resulting from the promoterless pBasic plasmid was set arbitrarily at 1 * Values indicate 
the average of at least three independent transfections. 

figure 4: 

Expression pattern of the transgene, Transgene (j^Gal) specific-transcripts were 
detected by reverse transcription-PCR in a ethidium bromide containing agarose geL 
Above each lane, the different tissues tested and the controls, (+) : kidney mRNA from a 
mouse in which the (3-gaiactosidase was inserted at the villin locus (32), and (-) : 
distilled H 2 0 as a template. RT-PCR were also performed on mRNAs of the 
endogenous villin gene and the ubiquitous TFIID gene. 

Figure S : 

p-galactosidase activity in sections of small intestine and colon from transgenic 
mice* Tissues were removed from transgenic mice, fixed, and stained for p~ 
galactosidase activity with X-Gal as described, p-galactosidase activity was observed in 
the epithelial cells, both immature and differentiated, along the crypt-villus axis in the 
small intestine (panel A). Note that the differentiated cells exhibited a strong signal as 
did the vfflus-associated cells and the Paneth cells (arrows) localized to the bottom of 
the crypt (panel B). The epithelial cells of the colon were also stained (panel C) 7 
panicularly all the cells in the crypt (panel D). Bars, 100 pm (panels A and C), 40 
(panels B and D). 
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Sequence of the genomic DNA of the murine villin gene comprising cis-acting elements 
capable to promote the transcription of the murine villin gene in intestinal mucosa and 
kidney proximal tubules. The sequence comprises the transcription initiation site at 
position 3442 followed by the sequence of exon 1 containing 46 pb, the translation 
initiation codon at position 8993 7 the sequence of intron 1 extending from nucleotide 
3488 to nucleotide 898 L 

Targeted expression of the {5-galactosidase protein using regulatory sequences of the 
mouse villin gene. The data have been obtained by transgenesis. 

Figure 8 ; 

Protocols for the preparation of transgenic mice expressing determined nucleotide 
sequences under the control of the murine villin gene regulatory sequences. 

EXPERIMENTAL PROCEDURES 

A large genomic region of the mouse villin gene has been analyzed. A 9 kb regulatory 
region of the mouse villin gene {harbouring 3.5 kb upstream the transcription start she 
and 5.5 kb of the first intron) was able to promote transcription of the Lac Z reporter 
gene in small and large intestines of transgenic mice, in a transmissible manner, and 
thus efficiently directed subsequent (3-galactosidase expression in epithelial cells along 
the entire crypt-villus axis. In the kidney, the transgene was also expressed in the 
epithelial cells of the proximal tubules but is likely sensitive to the site of integration. A 
construct lacking the first intron restricted p-galactosidase expression to the small 
intestine. Thus, the 9 kb genomic region contains the necessary cis-acting elements to 
recapitulate the tissue-specific expression pattern of the endogenous vilhu gene. Hence, 
these regulatory sequences can be used to target heterologous genes in immature and 
differentiated epithelial cells of the small and/or large intestinal mucosa. 
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Here we report the analysis of tissue-specific expression of the mouse viliin gene using : 
(i) DNase I-hypersensitive sites assays, (ii) transieat-transfection assays and (iii) 

transgenic mice- 
Cell culture and ex vivo transient transfection. 

Human colon carcinoma CaCo2 cells were cultured at 37°C t 10% in C02, in Dulbecco 
modified Eagle medium supplemented with 10% fetal scrum, IX nonessential amino- 
acids and 5 mM L-glutamine. Pig Kidney proximal tubules derived-LLCPKl cells and 
canine kidney distal tubules denved-MDCK cells were cultured at 37°C, 10% C02, in 
Dulbecco modified Eagle medium supplemented with 10% fetal calf serum and 5 mM 
L-glutamine. Cells cultures, approximately 50% confluent in 60 mm-dishes containing 
serum-free medium, wre cotransfected using IS ^1 of Lipofectin reagent (Life 
Technology es> Inc.) with 5 jig of each fj-galactosidase reporter plasmid construct and S 
}ig of the control plasmid, pRSVLuc, which contains the luciferase gene under the 
control of the Rous sarcoma virus promoter. The serum-free medium was changed to 
growth culture medium 6 h after transfection, and cells were harvested 48 h later. Cell 
extracts were assayed by chemiluminescent detection of both (5-galactosJdase (Galacto- 
Light, Tropix, Inc.) and luciferase (Luciferase Assay Kit, Tropix, Inc.) activities using a 
luminometer (Berthold). (3-galactosidase activity (light units) was corrected for 
variations in transfection efficiencies as determined by luciferase activity. The volume 
of cell extracts used in the p-galactosidase and luciferase assays wre adjusted such taht 
the enzyme activity was always within the linear range of the assay. All transfections 
were repeated at least three times. Results are expressed as -fold induction over that of 
the vector without promoter* pBasic. 

Primer extension analysis. 

Total RNA was isolated from mouse intestine with RNA NOW reagent (Biogentex) 
under the conditions suggested by the supplier. For primer extension assay, 2 ng of 32 P- 
labeled oligonucleotide probe (5'-GAGTGGTGATGTTGAGAGAGCCT-3 ') 
complementary to nucleotides +81 to +103 of the murine viliin cDNA (GenBank 
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Accession No, M98454) was hybridized with 30 pg of total RNA at 60 P C (0.25 M KC1, 
10 mM Tris-HCl, pH 7.5 4 1 mM EDTA) for 90 min. Transcription with 5 U/\i\ of 
Moloney murine leukemia vims reverse transcriptase (Life Technologies, inc.) was 
carried out at 37°C for 90 mm in a 300 jjJ of a solution containing 75 mM KC1, 3 mM 
MgCl> 50 mM Tris-HCl (pH S.3), 10 mM dithiothreitol, 0.75 mM deoxynucleoside 
triphosophates, 75 p.g/ml actinomycin D and 0.3 U/^I RNasin. The primer extension 
products were separated by electrophoresis in denaturing ?% polyacrylamide gels. The 
full-length extension product (105 nucleotides) was obtained by comparison with the 
length of the comi grating sequencing reaction products, A primer extension control 
experiment was performed on the same total RNA preparation, using a 3i P-Iabeled 
oligonucleotide probe (5*-CATAGTTCTCGTTCCGGT-3 *) complementary to 
nucleotides +63 to +80 of the mouse intestinal fatty acid binding protein (I-FABP) 
cDNA and generating a 81 -nucleotide extension product (27). 

DNase I-bypersensitive $jtes analysis- 
Tissues from 30 mice were used per assay of intestine, kidney, liver and spleen. Nuclei 
preparation and DNase I digestion were performed as described (28) with minor 
modifications. Nuclei were digested without or with 20 to 160 units of DNase I (DPRF 
Wonhington) for 10 min ar 0°C. Genomic DNA was purified by three rounds of (1:1) 
phenol-chloroform extraction followed by chloroform extraction and precipitation with 
ethanol. 10 \ig of each sample was digested overnight with restriction enzyme (BamRl 
or BgUL). The DNA fragments were separated by electrophoresis on a 0.8% agarose gel 
in TAE (40 mM Tris (pH 7.2), 20 mM sodium acetate, 1 mM EDTA), transferred onto a 
charged nylon membrane (Hybond-NT, Amersham), and hybridized at 65°C overnight 
with a random-primed (rediprime, Amersham) 32 P-labeled probe. The probe, the BgUI- 
Pstl probe (0.5 kb) (as indicated in Fig. 2) was used to map the DNase I-hypersensitive 
sites in the BarnHI, Bgffl fragments. The filter was washed using (IX SSC (0.15 M 
NaCL, 15mM sodium citrate, pH 7.0), 0.1% SDS, 55°C) and exposed to film overnight 
at -70°C with an intensifying screen, 
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Plasmids construction. 

All constructs described were subcloned into the pBluescript II KS vector (Stratagene) 
with fragments isolated from a AD ASHE phage containing a 163 kb region (9 kb 
5 upstream and 73 kb downstream from the translation initiation codon) of the mouse 
villin gene (29). The pDl construct (as described in the Fig, 3B) was prepared by 
Iigating a BamW. fragment of 5 J kb (1,8 kb upstream from the ATG translation 
initiation codon of the mouse villin gene, subcloned 5' to the nuclear localization signal- 
0-galactosidase gene-SV4G polyadenylation site, using a polymerase chain reaction 
10 (PCR) strategy) at the BamHl site in a plasmid containing the 3.7 kb region of the 
mouse villin gene (immediately 5' to the L8 kb region described above). The pAl and 
JP; pA2 (containing an internal 1 kb deletion) constructs have resulted from several steps 
p based on the BstEU sites present in the 3-7 kb region described above and in a plasmid 
containing the 3.5 kb region of the mouse villin gene (immediately 5' to the 3.7 kb 
|5 region). The pCl and pC2 constructs were derived from the pAl and pA2 plasmids cut 
with Apal and re-li gated, respectively. To generate the pBl construct, a BgHJ fragment 
O (480 bp) from the 3.5 kb region described above was excised and cloned into the Kpnl 
Jjfj; site of the pCl plasmid. The pA3, pB3 and pC3 constructs correspond to the pAl, pBl 
rij and pCl deleted from the incron 1 (Fig. 3B). The sequence between the transcription 
2Ef initiation start site and the translation initiation codon, excluding the intron 1, was 
deduced from that of the murine villin cDNA (GenBank Accession No, M98454) and 
was introduced into the BgKL-NcoI sites of the pCl construct by using a dimerized 
oligodimer made of a coding-strand oligonucleotide (5*- 
GATCTCCCAGGTGGTGGCTGCCTCTTCCAGACAGGCTCGTCCAC-3*) and anon 
25 coding-strand oligonucleotide (5'- 

CATGGTGGACGAGCCTGTCTGGAAGAGGCAGCCACCACCTGGGA-S*), 
resulting in the pB3 construct. The pA3 and the pC3 constructs were derived from the 
pB3 plasmid by Iigating an Apal fragment (3.1 kb) and a BglEL fragment (480 bp) from 
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the 3.5 kb region described above, at the Apal site in the pB3 plasmid respectively. 
Subcloning steps were confirmed by DNA sequencing. 

Transgenic mice generation. 

The transgenes digested with Xhol-NoiL purified by gel elution and Elutip Columns 
(Schleiche & Schueil). The linear fragments were supended in 10 mM Tris-HCl ? pH 7.4, 
0.2 mM EDTA and were injected into the pronuclei of the fertilized eggs of the B6/D2 
mice. Mice carrying transgenes (founders) were first identified by PCR of genomic 
DNA isolated from a short segment of tail to confirm the presence of the p-galactosidase 
gene and then analyzed by Southern blotting to determine the copy number of the 
integrated transgene. Each founder animal harbored one copy of the transgene per 
genome. Small intestine, colon, kidney, stomach, liver, heart, lung, thymus, brain, 
spleen and muscle were dissected from transgenic mice* cut in small pieces, quickly 
frozen in liquid nitrogen-cooled isopentane either prepared for total RNA extraction or 
embedded in Tissue~Tek CXC/T Compound (Sakura Finetek) blocks to perform 
cryosections. 



Reverse transcriptton-PCR Analysis. 

Total RNA was isolated from mouse tissues described above, with SV Total RNA 
Isolation System (Promega) under the conditions suggested by the supplier. 20 ng of 
pd(N)$ random primer (Pharmacia) were hybridized with 2 fig of total RNA at 70°C for 
10 min in distilled water. Reverse transcription with 200 U of Moloney murine leukemia 
virus reverse transcriptase (Superscript II, Life Technologies, Inc.) was carried out at 
37°C for 90 min in a 20 yl solution of IX First Strand Buffer (Life Technologies, Inc), 10 
mM dithiothreitol, 0.5 mM deoxynucleoside triphosphates and 0.4 U/jJ RNasin. 2 pi of 
the resulting cDNAs, were amplified by PCR reaction in 50 for 40 cycles. Each cycle 
consisted of 60 sec at 94°C, 60 sec at 51°C (for transgene and viUin) and 57*C (for 
TFHD) S and 30 sec at 72°C. For the transgene primers, 5'- 
CAACTTCCTAAGATCTCC-3* coding strand and 5'. 
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ATTC AGGCTGCGC AACTGTT-3 ' non-coding strand were used, generating a 250 bp 
product. For villin amplification 5^AACTTCCTAAGATCTCC-3' coding strand 
primer and S'-GCAACAGTCGCTGGACATCACAGG-S 1 non-coding strand primers 
were used, generating a 473 bp product; for TFHD amplification 5 1 - 
5 CCACGGACAACTGCGTTGAT-3 1 coding strand primer and 5 1 - 
GGCTC AT AGCTACTGAACTG-3 * non-coding strand primer were used, generating a 
220 bp product, In all cases, one-fifth of the PCR product was run on an ethidium 
bromide containing agarose gel. 

1 0 Detection of {3-galactosidase activity. 

Cryosections (5 ym) from the tissues described above wre then dried overnight at room 

'I temperature, fixed with 3% paraformaldehyde for 5 min a washed in phosphate buffered 

IB saline and incubated in a staining solution that contained 0.4 mg of 5-bromo-4-chloro-3- 
indolyl-p-D-galactopyranoside (X-Gal) per ml, 4 mM potassium feiricyanide, 4 mM 
potassium ferrocyanide, 2 mM MgC\ 2 at 37°C for S h. After staining, the sections were 

III again washed in phosphate buffered saline, mounted and examined histologically to 

^ detect the expression of exogenous P-galactosidase. 

jfi RESULTS 

M Determination of the transcription start site. 

To determine the transcriptional start site of the mouse villin gene, total RNA was 
isolated from intestine and analysed by primer extension assay using an oligonucleotide 
complementary to the mouse villin cDNA downstream of the ATG translation^ start 
site. The efficiency of the reaction was confirmed by primer extension of the mouse 

25 intestinal fatty acid binding protein (I-FABP) gene (fabpi) from the same RNA 
preparation (27). Analysis of the fabpi extension product on a sequencing gel by 
comparison with a sequence ladder (Fig. 1A) revealed a strong signal band of a size of 
81 bp as expected. The extension product of villin was 105 pb indicating that the 
transcriptiona] start site (an adenosine residue subsequently designed as nucleotide +1) 
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was 57 nucleotides upstream of the translation initiation codon of the murine viliin 
cDNA (Fig. IB). Comparison of the genomic sequence encompassing 9 kb upstream 
from the ATG initiation codon with the cDNA sequence, position of splice site 
consensus sequences in the 9 kb genomic sequence (Fig. IB) and determination of the 
transcription start site reveal that the mouse viliin gene has one transcription start site 
which is separated from the ATG iniaxion codon by a 5,5 kb intronic region (Fig,lC). 

DNase I-hypersensitive sites in the mouse viliin gene. 

To characterize the key regulatory regions involved in the specific control of viliin 
expression, we have mapped the DNase I-hypersensitive sites (31) in the mouse viliin 
gene (along a region extending 9 kb upstream and 4.4 kb downstream from the 
translation initiation codon, as represented in Fig + 2A). The chromatin form of the 
mouse viliin gene in different tissues (intestine, kidney, liver and spleen) was submitted 
to limited DNase I digestion and subsequently digested with the appropriate restriction 
enzymes. Accordingly, nuclei were isolated from intestine, kidney, liver and spleen. The 
Dnase I-digesied DNA was restricted by Blgll and hybridized with a 0.5 kb probe 
homologous to the 5 1 of the 7,5 kb Blgll fragment (Fig, IB). Two sets of Dnase I 
incubation-related fragments were detected, migrating at 5,5 and 2,7 kb, and 
corresponding to hypersensitive sites designated as HS I (located at approximately +5,5 
kb downstream from the transcription start (+1) site, just upstream the ATG initiation 
codon) and HS II (located at approximately +3 kb downstream from the (+1) site), 
respectively. HS I was observed in nuclei isolated from intestine, kidney and liver, 
whereas HS II was only present in intestinal tissue. No specific hypersensitive sites 
were detected in nucleic isolated from spleen. The presence and location of these 
hypersensitives sites were detected in nuclei isolated from spleen. The presence and 
location of these hypersensitives bands were confirmed by stripping and rehybridizxng 
the same blot with the 0.8 kb probe (Fig. 2A) homologous to the other end of the 7.5 kb 
BglO fragment (data not shown). Using BamHI digestion and the 0.5 kb probe (Fig. 
2C), five sets of DNase I-treated nuclei-related fragments were detected, migrating at 
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3.4 3 43, 4.7 and approximately 10 and 15 kb, corresponding to the hypersensitive sites 
H$ H (previously identified and consequently confirmed), HS III (located at 
approximately -0.5 kb upstream from the (+1) site), HS IV (located at approximately -1 
kb upstream the (+1) site), HS V (located at approximately -10 kb upstream from the 
(+1) site) and HS VI (located at approximately -15 kb upstream from ihe (+1) site), 
respectively. HS III was observed in nuclei isolated from both intestine and kidney, 
whereas HS IV was only present in intestinal tissue as HS II. The hypersensitive site HS 
V and HS VI were only present in liver tissue (in which villin is weakly expressed) and 
were located far upstream from the transcription start site in regions (i) which have not 
been subcloned and (ii) which could belong to an adjacent gene; for these reasons, these 
hepatic-specific hypersensitives sites were not analyzed further As for Bgffl digestion, 
no specific hypersensitive sites were detected in nuclei isolated from spleen. Using other 
independent restriction digestions (EcoRI and HindTCI) and the 0.5, 0.8 and 1.25 kb 
probes (Fig. 2A) to map the locations of the hypersensitive sites, similar results were 
obtained (data not shown). 

In conclusion, four major distinct DNase I-hypersensitives sites (HS I to HS JV) were 
shown to be present in the region extending from -1 kb to +5.5 kb in respect to the 
transcription start site (Fig. 3A) of the mouse villin gene. These sites were detected in 
intestine (HS I to HS IV), kidney (HS I and HS III) and liver (HS I), tissues in which 
villin is expressed, but they were not found in spleen, a tissue that does not produce 
villin- These findings correlate with the tissue-specific control of villin gene expression, 
and suggest that the putative critical regulatory elements lie within ihese regions. HS II 
and HS IV were only detected in intestine and are probably associated with tissue- 
specific transcription factors binding sites involved in the positive control of villin gene 
intestinal expression- 



Analysis of promoter activity by transient expression. 

To test the effects of the segments containing the DNase I-hypersensitives sites (Fig. 
3A) on transcriptional activity and to define more precisely the element(s) controlling 
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viliin gene expression in the intestine, segments were subcloned upstream of a 
promoterless Lac Z plasmid (coding for the bacterial (5-gaIactosidase gene with a 
nuclear localization signal sequence) (Fig. 3B). The resulting recombinant piasmids 
were tested by transient transfection assays in cultures cell lines. The construct pAl 
contained all the subcloned regions dowstream from the ATG initiation codon, 
encompassing the four DNase I-hypersensitive sites (HS I to HS IV) described above 
and the 5.5 kb intronic sequence, intron 1- Piasmids pA2 and pA3 were identical to pA2 
except for the presence of intestine-specific hypersensitive site HS II and intron 1 ? 
respectively, Plasmid pBl and plasmid pCl were similar to plasmid pAl t but lacked the 
regions extending from -480 bp to -3.5kb and -100 bp to -3.5 kb according to the 
transcription start site, respectively. Plasmid pC2 was identical to pA2, but lacked the 
region extending from -100 bp to -3.5 kb. Piasmids pB3 and pC3 were identical to pBl 
and pCl except for the presence of intron 1, respectively. The plasmid pDl was 
identical to pAl except for the presence of the transcription start site and the region 
extending upstream from this site. The plasmid pBasic, which does not contain a 
promoter or enhancer, and a pControl plasmid which possesses the SV40 promoter, 
were also tested in each experiment. Transient transfections were perfonned in the 
human colon enterocyte-like CaCo2 cell line and the pig kidney proximal tubules- 
derived LLCPK1 cell line which express viliin, and in kidney epithelial cells in which 
no viliin expression is detected, MDCK (a canine kidney distal tubules-derived cell 
line). Transcription from the viliin promoter was measured by assaying p-galactosidase 
activity in extracts made from the transfected cells, and the results were expressed as - 
fold induction over that of the promoterless vector, pBasic (Fig. 3C). High levels of p- 
galactosidaae activity in the pControl transfected cell lines (CaCo2 cells, 50*fold over 
that of pBasic; LLCPK1 cells, 98-fold) demonstrated the presence of efficient general 
transcription/translation machineries in these cells. Very low levels of £-galactosidase 
activity in pDl both transfected cells compared to pBasic transfected cells showed that 
the transcription start site was necessary for an efficient specific transcription of the 
reporter gene and that nonspecific transcription was not initiated elsewhere in the viliin 
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regulatory sequences. The construct pAl expressed the p-galactosidase gene at the 
highest level in CaCo2 cells (8-fold over pBasic) as compared with LLCPKl cells (L5- 
fold over pBasic) suggesting that the four DNase I-hypersensitives sites together with 
the first intron are necessary to promote efficiently transcription in cells of intesriiial 
origin. Deletion of the fragment containing the intestinal-specific hypersensitive site HS 
II (pA2) dramatically decreased p-galactosidase expression in CaCo2 cells (2-fold over 
pBasic) to about 25% of that of pAl, demonstrating that a major element which confers 
intestinal activity was confined within this fragment. Similar results were obtained when 
the region upstream from the transcription start site (encompassing HS III and HS IV) 
was almost wholly deleted with or without HS II (pCl and pC2, respectively). The 
deletion of the intronic region alone (pA3) 3 or in combination with deleted sequences 
upstream from the transcription start site (pB3 and pC3 extends only from -480 and - 
100 bp, respectively), affected to a lesser extent (3-galactosidase expression in the same 
intestinal cells (5.5-fold over pBasic), with a decrease to only about 65% of that of pAl, 
demonstrating that the regulatory elements which lay within 100 bp were sufficient to 
promote transcription in cultured cells. However the level of p-galactosidase activity 
increased strongly when the piasmids pA3 T pB3 and pC3 were transfected in LLCPKl 
cells (10, 44, and 45-fold over pBasic 3 respectively) showing that the absence of the first 
intron, in combination with the lack of intestine-specific HS IV, was able to promote 
transcription in a kidney cell line. This would suggest that negative elements which 
confer repression in kidney transcription are confined in these elements, 
To test specificity, the villin promoter-related constructs were transfected in MDCK 
cells, which do not express villin. After transfection, these cells showed only base-line 
levels of p-galactosidase activity when compared to pBasic-related activity (data not 
shown), demonstrating that the villin regulatory sequences were unable to promote 
efficient transcription in nonexpressing villin cells, and that consequently the expression 
of the reporter gene in CaCo2 and LLCPKl cells is specifically dependent upon these 
regulatory sequences. Taken together, these results from transient transfection of 
cultured cells demonstrate that (i) the mouse villin genomic sequence, extending from - 
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3.5 to +5.5 kb, directs specifically an efficient level expression of the p-galactosidase 
reporter gene in intestine-derived cells, (ii) this level is dramaticaly reduced when the 
intronic intestine-specific hypersensitive site HS II or the region upstream from the (+1) 
site is deleted, (iii) lack of the entire first intron seems to partially restore the intestine- 
related ability in promoting transcription, and (iv) lack of the entire first intron in 
combination with intestine-specific hypersensitive site HS IV is correlated with a strong 
increase of ability in promoting transcription in kidney-derived cells. 

Analysis of transgenes expression in mice : 

Since the -3.5 to +5.5 kb region of the mouse villin contained the enterocyte-like- 
specific promoter/enhancer activity in rransient-transfection assays, we examined the 
ability of this region to drive intestine-specific expression of the j3-galactosidase reporter 
gene in transgenic mice. The construct pAl was then prepare, after excision of the 
plasmid sequence, and injected into fertilized eggs. Five founder animals which 
contained the pAl construct as a transgene were obtained. The founder mice were 
analysed for mRNA reporter gene expression in several adult tissues by reverse- 
transcription PCR (RT-PCR) analysis. From the same cDNA samples, products 
encoding ^galactosidase, villin and TFIDD were analyzed. The PCR assays enabled 
only the defection of spliced transcribed mRNA, excluding that from genomic DNA 
itself, by means of an exon-connection strategy by combination of a 5' PCR primer from 
within the mouse villin promoter sequence just upstream of the splice donor site, and 
the 3' primers from within the p-galactosidase gene or the villin gene. For each founder, 
no reporter gene expression was detected in the tissues in which villin mRNAs were not 
detected using the PCR assay (Fig. 4). For all founder mice, the reporter gene 
transcription was detected along the cephalocaudal axis of the gut (duodenum, jejunum, 
ileum, proximal and distal colon) following the intestine-specific expression of the villin 
gene (Fig. 4). In the kidney, the transgene was only transcribed in one founder of five 
animals obtained (Fig. 4). TFED mRNA was present in all samples from tissues in 
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which the reporter gene expression could not be delected (Fig. 4), confirming the quality 
of RNA from these tissues. 

To examine the precise cellular distribution of transgene expression within the tissues, 
cryostat sections of small intestine, colon and kidney were prepared and subsequently 
stained for p-galactosidase enzyme activity. Immunofluorescence analysis of p- 
galactosidase expression was also performed on the same sections and similar results 
were obtained with the two procedures. Sections of small intestine, colon and kidney 
from non-transgenic animals exhibited no detectable ft-galactosidase activity. For four 
of five transgenic mice, a heterologous pattern of expression in small intestine and colon 
was observed in this assay. This heterogeneity was due to mosaicism since we examined 
founder animals. The expression was confined to the nucleus of the epithelial cells, as 
expected because the p-gaiactosidase gene contains a nuclear localization sequence 
signal (Fig, 5), The staining was detected by a stronger signal in the villi migrating cells 
when compared with the crypts cells, of both small intestine (Fig. 5A) and colon (Fig. 
5C) epithelium, thus confirming that the -3.5 to 4-5.5 kb region of the mouse villin gene 
is able to recapitulate precisely the cellular pattern of expression, along the crypt-villus 
differentiation axis, of the endogenous villin gene (17). A continuous labelling of all 
cells of the crypt (Fig. SB and D) was observed, suggesting the expression of the 
transgene in the stem cells (10). It is noted worthy that the intensity of the 
galactosidase staining was similar to that of intestinal sections from chimeric animals 
Owhich possess a (3-galactosidase gene integrated at the villin locus by homologous 
recombination procedure (32), indicating that the -3.5 to +5.5 kb region of the mouse 
villin gene was able to promote intestinal transcription as efficiently as the mouse villin 
gene itself. In the kidney of the founder mouse in which the transgene was detected by 
RT-PCR, the staining was only observed in the epithelial cells of the proximal tubules 
where the villin gene is expressed. The founder animals were able to transmit the 
transgene to their offspring with a similar pattern of (^galactosidase expression. In our 
attempt to direct an efficient expression of the reporter gene in the intestinal epithelium 
with shorter regulatory sequences, plasmids pA3, pB3 and pC3 were used to generate 
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transgenic mice, because these constructs display efficient levels of p-galactosidase 
activity in intestine-derived CaCo2 cells. The presence of the transgene assessed by p~ 
galactosidase staining and immunofluorescence procedures was observed in three of the 
four independent lines of pA3 transgenic mice generated. These three lines expressed 
5 the reporter gene only in the small intestine (in both the immature and differentiated 
epithelial cells along the crypt/villus axis), and all three lines failed to express the 
transgene in the other tissues tested, particularly note worthy is the lack of expression in 
the colon and the kidney (data not shown). These results demonstrate that (i) the 3.5 kb 
regulatory region upstream the transcription start site of the mouse villin gene is 

10 necessary and sufficient to sustain expression strictly in small intestine of transgenic 
mice, (ii) the first intron of the mouse villin gene is required for colon and kidney 

y expression in transgenic mice. Concerning the pB3 and pC3 transgenic mice, no 
transgene expression was observed in all tissues examined, including small intestine, 

5" colon and kidney. Thus, the key cis-acting elements of the villin gene required for 

15 intestinal and/or kidney-related expression of transgene(s) in transgenic mice are not 
located only within the region encompassing -480 bp upstream from the transcription 

ff start site, as observed in the cultured epithelial cells. 

J DISCUSSION 

20 In this report, we demonstrate that cis-acting sequences located within a 9kb region 
(-3.5 to +5.5 kb from the start site of transcription) of the mouse villin gene are 
sufficient to direct both correct tissue-specific and high expression level of the p» 
galactosidase reporter gene in transgenic mice, when compared with the endogenous 
gene (19), Reporter gene expression is detected in the whole intestinal tube and 

25 appropriately restricted to epithelial cells along the crypt-villus axis of both small 
intestine and colon. In addition, these regulatory elements can maintain a gradient of fa 
galactosidase gene expression from the crypts of LieberkQhn to tips of villi that 
precisely reproduce the gradient exhibited by the murine villin gene (17). Similarities 
between transgene and endogenous gene expression were also noticed as judged by a 
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comparison with the staining intensity of p-galactosidase activity in intestinal sections 
from our transgenic mice and mice in which the reporter gene has been inserted at the 
natural villin locus by homologous recombination (32). 

In the kidney, for only one animal of five analyzed, mouse reporter gene expression was 
restricted to epithelial cells of the proximal tubules recapitulating the villin expression 
pattern in this tissue. This suggests that transcriptional mechanisms specifying gene 
expression to intestine and kidney tissues are in the -3.5 to +5.5 kb region of the mouse 
villin gene, and that those related to kidney may be sensitive to positional effects. 
Indeed it is known that the transgene expression is dependent on site of chromosomal 
integration, and can be influenced by regulatory regions in the vicinity, presumably 
acting on chromatin conformation (33). The construct lacking entirely the first intron of 
5.5 kb, but which harboures 3.5 kb 5' to the start site of transcription of the mouse villin 
gene, placed in front of the (3-galacosidase gene, restricts the in vivo expression of the 
reporter, gene only into the epithelial cells along the crypt-villus axis of the small 
intestine- The extinction of the reporter gene expression in the kidney might be due to 
strong positional effects, as reported above, whereas the extinction related to the colon 
might be due to the absence of regulatory elements of the intron 1, such as the intestine- 
specific DNase I-hypersensitive site HS H. Constructs harbouring only the first 480 bp 
and 100 bp 5' to the start site of transcription, in combination with the lack of the first 
intron, placed in front of the {5-galacosidase gene, both failed to drive intestine-specific 
and kidney-specific expression of [3-galactosidase, suggesting that the intestine-specific 
DNase I-hypersensitive site HS IV localized just upstream from the 480 bp might pJay 
an important role in promoting reporter gene expression into the epithelial cells of the 
small intestine. Thus, distinct and separable regulatory elements in the mouse villin 
gene may direct transgene expression along the cephalocaudal axis of the gut the 
regulatory elements required for transgene expression in the small intestine might be 
localized in the 3.5 kb region (i.e. the HS IV site) upstream from the transcription start 
site, whereas those necessary for the colonic expression might be localized in the first 
intron (i.e. the HS H site). The inability of shorter regulatory sequences of the mouse 
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viliin gene to direct correct expression of the reporter gene in the whole intestine of 
transgenic mice might also be explained by spatial rearrangement of chromatine 
Structure due to the lack of the entire first intron. In fact, the results described here are 
reminiscent of those of the adenosine deaminase gene (34) and the aldolase Bgene (35) 
5 in which elements located in the first intron could be required for transgene expression 
in vivo, because they may contain cis-acting tissue-specific enhancer elements and/or 
elements involved in promoting decondensation of the chromatin structure, allowing the 
accessibiliy for transcription factors and RNA polymerase. 

To explain the discrepancy seen in the ability of the mouse viliin gene regulatory 
10 elements to promote transcription of the reporter gene in cell cultures versus transgenic 
^ animals, we may argue that the regulation of gene expression in the intestinal epithelium 
2 occurs as cells differentiate and migrate along the crypt-villus axis. This process 

depends on the contacts that these cells maintain with others neighboring cells on the 

2 one hand, and with the extracellular matrix on the other hand (36). Thus, an ex vivo 
95 system as the intestine-derived CaCo2 cell line used in the study, is limited by its weak 

ability in recapitulating the temporal and spatial complexities of this epithelium and 
zf emphasizes the importance to use in vivo models to define a function for specific 

3 regulatory sequences (37, 38). 

5 Previous studies carried out in transgenic mice to map transcriptional regulatoiy 
SO elements responsible for intestinal expression have been performed using cis-acting 
sequences of genes expressed in villus associated-enterocytes of small intestine (4, 5, 
38-40). In some of these cases, precocious activation in the crypts in combination with 
extended expression in the colon occurs in an inappropriate manner. Thus, to our 
knowledge, the 9kb regulatory region of the mouse viliin gene represents the only 
25 characterized cis-acting sequences reported today that allow the expression of a 
heterologous gene in small intestine and colon epithelial cells of transgenic mice 
reproducing with great fidelity the tissue-specific and cell-specific pattern of expression 
when compared with that of the endogenous gene itselfl In addition, the mice lines that 
drive a transgenic expression exclusively restricted to the intestinal mucosa could 
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already be studied after selection of those which will not display expression into the 
kidney because of the positional effects. 

The ability to tsrget genes of interest m transgenic mice following the villin restricted- 
pattern of expression, and particularly in the crypts stem cells enables to the 
5 development of targeted genes in animal models. Experimental mouse models 
reproducing several steps of human colorectal carcinogenesis (a possible genetic 
pathway has been proposed by Fearon and Vogelstein (41)) could for instance be 
obtained by efficiently targeting the associated oncogenes or mutated tumor suppressor 
genes to colonocytes using the villin regulatory region. Another use could lie in the 
10 establishment of new cell lines derived from the digestive tract by targeting a 
theimosensitive SV40 T antigen to the crypts resident-progenitors of intestinal cells, as 
used in other systems (42-44). 

Several of these applications are illustrated in the proposed protocols disclosed on 
Figure 8. 
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